The USTC System for Blizzard Challenge 2017

نویسندگان

  • Ya-Jun Hu
  • Chuang Ding
  • Li-Juan Liu
  • Zhen-Hua Ling
  • Li-Rong Dai
چکیده

This paper introduces the details of the speech synthesis system developed by the USTC team for Blizzard Challenge 2017. A 6.5-hour corpus of highly expressive children’s audiobook was released to the participants this year. A parametric system that modeling speech waveforms was built for the task. Firstly, long short term memory (LSTM)-based recurrent neural networks (RNN) were adopted for the baseline system, including tone and breaking indices (ToBI) prediction, duration modeling and acoustic modeling. Then, we proposed a generative adversarial network (GAN) based post-filtering to relieve the oversmoothing in acoustic modeling and compensate for the differences between natural and synthetic spectrum in the baseline system. At last, a WaveNet based neural vocoder was utilized to model speech waveforms from acoustic feature instead of melcepstrum vocoder. The evaluation results show the effectiveness of the submitted system.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The USTC System for Blizzard Challenge 2009

This paper introduces the USTC’s speech synthesis system for Blizzard Challenge 2009. USTC attended all English tasks including the hub tasks and the spoke tasks. According to the various conditions for different tasks, different versions of HMM based unit-selection systems are constructed based on the USTC Blizzard Challenge 2008 system. Many new techniques are employed in our speech synthesis...

متن کامل

The USTC System for Blizzard Challenge 2011

This paper introduces the speech synthesis system developed by USTC for Blizzard Challenge 2011. USTC attended all the English tasks including a hub task and a spoke task. We developed a hidden Markov model (HMM) based unit selection system for both the tasks. And also some new techniques are employed in our speech synthesis system construction. Results of some internal experiments comparing th...

متن کامل

The USTC System for Blizzard Challenge 2010

This paper introduces the speech synthesis system developed by USTC for Blizzard Challenge 2010. USTC attended all English tasks including the hub tasks and the spoke tasks. According to the various conditions for different tasks, different versions of synthesis systems are constructed. Many new techniques are employed in our speech synthesis system construction. Results of internal experiments...

متن کامل

USTC System for Blizzard Challenge 2006 an Improved HMM-based Speech Synthesis Method

This paper introduces the USTC speech synthesis system for Blizzard Challenge 2006. The HMM-based parametric synthesis approach was adopted for its convenience and effectiveness in building a new voice, especially for the nonnative developers. Some useful techniques were also integrated into our system, such as minimum generation error (MGE) training, phone duration modeling and linear spectral...

متن کامل

The USTC System for Blizzard Challenge 2008

This paper introduces the speech synthesis system developed by USTC for Blizzard Challenge 2008. Two synthetic voices from the released UK English database are built using the HMMbased unit selection synthesis method, which is a hybrid of statistical parametric synthesis and unit-selection techniques. In this method, the optimal sequence of phone-sized candidate units is selected from the datab...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2017